Testing Probability Distributions using Conditional Samples
نویسندگان
چکیده
We study a new framework for property testing of probability distributions, by considering distribution testing algorithms that have access to a conditional sampling oracle. This is an oracle that takes as input a subset S ⊆ [N ] of the domain [N ] of the unknown probability distribution D and returns a draw from the conditional probability distribution D restricted to S. This new model allows considerable flexibility in the design of distribution testing algorithms; in particular, testing algorithms in this model can be adaptive. We study a wide range of natural distribution testing problems in this new framework and some of its variants, giving both upper and lower bounds on query complexity. These problems include testing whether D is the uniform distribution U ; testing whether D = D∗ for an explicitly provided D∗; testing whether two unknown distributions D1 and D2 are equivalent; and estimating the variation distance between D and the uniform distribution. At a high level our main finding is that the new conditional sampling framework we consider is a powerful one: while all the problems mentioned above have Ω( √ N) sample complexity in the standard model (and in some cases the complexity must be almost linear in N), we give poly(logN, 1/ )-query algorithms (and in some cases poly(1/ )-query algorithms independent of N) for all these problems in our conditional sampling setting. ∗[email protected], Columbia University. Supported by NSF grants CCF-1115703 and CCF-1319788. †[email protected], Tel Aviv University. Supported by ISF grants 246/08 and 671/13. ‡[email protected], Columbia University. Supported by NSF grants CCF-0915929 and CCF-1115703.
منابع مشابه
Testing equivalence between distributions using conditional samples
We study a recently introduced framework [7, 8] for property testing of probability distributions, by considering distribution testing algorithms that have access to a conditional sampling oracle. This is an oracle that takes as input a subset S ⊆ [N ] of the domain [N ] of the unknown probability distribution D and returns a draw from the conditional probability distribution D restricted to S....
متن کاملProperty Testing of Joint Distributions using Conditional Samples
In this paper, we present the first non-trivial property tester for joint probability distributions in the recently introduced conditional sampling model. The conditional sampling framework provides an oracle for a distribution μ that takes as input a subset S of the domain Ω and returns a sample from the distribution μ conditioned on S.For a joint distribution of dimension n, we give a Õ(n3)-q...
متن کاملImproving and Extending the Testing of Distributions for Shape-Restricted Properties
Distribution testing deals with what information can be deduced about an unknown distribution over {1, . . . , n}, where the algorithm is only allowed to obtain a relatively small number of independent samples from the distribution. In the extended conditional sampling model, the algorithm is also allowed to obtain samples from the restriction of the original distribution on subsets of {1, . . ...
متن کاملTesting a Point Null Hypothesis against One-Sided for Non Regular and Exponential Families: The Reconcilability Condition to P-values and Posterior Probability
In this paper, the reconcilability between the P-value and the posterior probability in testing a point null hypothesis against the one-sided hypothesis is considered. Two essential families, non regular and exponential family of distributions, are studied. It was shown in a non regular family of distributions; in some cases, it is possible to find a prior distribution function under which P-va...
متن کاملA Conditional Test for Exponentiality Against Weibull DFR Alternatives Based on Censored Samples
A conditional test based on quadratic form using type-2 censored sample for testing exponentiality against Weibull alternative is proposed. The simulated percentage points and powers are given. The proposed test performs well for identifying Weibull DFR alternative even for small sample. An example is also given.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- SIAM J. Comput.
دوره 44 شماره
صفحات -
تاریخ انتشار 2012